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HIGHLY-EFFICIENT, SELF-INACTIVATING, 
RECOMBINATION-FREE, U3-FREE RETROVIRAL VECTORS 

Background of the invention 
^ Field of the Invention 

This invention relates to recombination-free, highly-efficient 
retroviral vectors that self-inactivate after one round of retroviral replication. The 
present vectors allow cell-type specific gene expression from tissue specific 
promoters and enhancers. The recombination-free retroviral vectors eliminate risks 
of downstream activation of cellular proto-oncogenes (tumorigenesis) that can occur 
with conventional vectors. This invention also relates to a retrovirus produced by 
the recombinant retrovirus vector and a host cell containing the provirus of the 
present invention. 

Description of the Background 

The disclosures referred to herein to illustrate the background of the 
invention and to provide additional detail with respect to its practice are 
incorporated herein by reference. For convenience, the disclosures are referenced 
in the following text and respectively grouped in the appended bibliography. 

Retroviral vectors are the most efficient tools to introduce genes into 
vertebrate cells. Clinical experiments have been conducted to use retrovirus vectors 
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to cure a genetic disease in humans (adenosine deaminase (ADA) deficiency). 
Besides correcting inborn errors of metabolism, gene therapy is also being tested in 
clinical trials to cure cancer and various other diseases (Science 1992, Vol. 258, pp. 
744-746). 

5 

Retroviral vectors are basically retroviral particles that contain a 
crippled viral genome in which all viral protein coding sequences have been 
replaced with the gene(s) of interest. As a result, such viruses cannot further 
replicate after one round of infection without the help of a helper virus. Retroviral 

10 vector particles are produced by helper cells (Figure 1). Such helper cells contain 
plasmid constructs which express all retroviral proteins necessary for particle 
production and replication. After the introduction (transfection) of the retroviral 
vector genome into such helper cells, the vector genome (an RNA genome) is 
encapsidated into virus particles (due the presence of specific encapsidation 

15 sequences). Virus particles are released from the helper cell carrying a genome 
containing only the gene(s) of interest (Figure 1). After infection of a fresh target 
cell, the RNA genome is reverse transcribed into DNA. The DNA copy is 
integrated into the host genome (Figure 2A). The integrated viral DNA is called 
the provirus. In the last decade, several retroviral vector systems, derived from 

2 0 chicken or murine retroviruses, have been developed for the expression of various 

genes (for reviews see Temin, 1987; Gilboa, 1990). 

Retroviral vectors have several limitations. For example, one major 
concern is the possible activation of proto-oncogenes as a result of the integration of 
25 the vector into the target cell genome. The activation of proto-oncogenes which is 
mediated by the viral LTR (Long Terminal Repeats) promoter and enhancer can 
lead to the malignant transformation (cancer) of the infected cell. Another problem 
with current retroviral vectors is regulated gene expression. Due to the presence of 
the retroviral LTR promoter and enhancer, it is impossible to design reliable vectors 

3 0 for tissue specific gene expression. 

To avoid these two shortcomings, retroviral vectors have been 
developed from murine leukemia virus (MLV) and spleen necrosis virus (SNV) that 
self-inactivate after one round of retroviral replication (Yu et al, 1986; Dougherty 
35 and Temin, 1987; United States patent no. 4,980,289 to Temin et al.). This has 
been achieved by deleting parts or almost all sequences of the retroviral U3 region 
of the right LTR (Figure 2B). As a resul: of the mechanism of the retroviral life- 
cycle, after one round of replication, a retroviral provirus is formed that does not 
contain LTR promoters. 
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However, in the case of MLV derived vectors the efficiency of gene 
transduction was about four orders of magnitude less than tn>a of a vector with two 
complete LTRs. In the case of SNV, efficiency of gene transduction decreased 
only about tenfold. However, detailed investigations revealed that the U3-region of 
the right LTR was reconstituted in the helper cell (Olson et aL, 1992). This 
reconstitution of the U3-region was the result of recombination (or gene conversion) 
of the vector with retroviral sequences present in the helper cell or in the other 
LTR. 

To prevent such recombination events, new SNV-derived helper cells 
were constructed that express retroviral particle proteins from promoters and 
enhancers different from those present in the retroviral genome. Such new helper 
cells were termed DSN cells. DSN cells contain two separate plasmids for the 
expression of gag-pol and env proteins (Dougherty and Temin, 1989). One plasmid 
contains the ga^-pol gene of SNV which is expressed from the CMV promoter. 
The other plasiiid contains the envelope gene unit of SNV which is expressed from 
the RSV promoter. Experiments performed with these new helper cells and SNV 
derived U3-minus vectors (e.g., pJD220SVHY, see Figure 3) revealed that the 
frequency of the recombination event(s) leading to a vector with two wild-type 
LTRs was markedly reduced. However, vectors containing two wild-type LTRs 
were stili recovered from transfected DSN helper cells (Dougherty et aL, 1989). 
This finding showed that the (unaltered) U3-region of the left LTR of the vector 
construct can also serve as a template for the recombination (or gene conversion) 
event resulting in the reconstitution of the U3-region of the right LTR. 

Brief Description Of Thf Figures 

FIGURE 1 is a diagram illustrating retroviral helper cells producing 
recombination-free, self-inactivating retroviral vectors. 

FIGURE 2 is a diagram illustrating retroviral vector replication. 

FIGURE 3 is a diagram illustrating retroviral vectors derived from 
spleen necrosis virus (SNV). 
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FIGURE 4 is a diagram illustrating the nucleotide sequences at the 
junction of the cytomegalovirus (CMV) with spleen necrosis virus sequences as 
present in the pPOlll and pPOH5 vector series. 



Summary Of The Invention 



The present invention pertains to a recombinant retrovirus vector 
having a U3-free 5' LTR, a partially deleted 3* LTR, all essential cis-acting 
sequences for replication, an internal promoter recognizable by a selected host cell, 
and a non-retroviral gene under the control of the recognized promoter wherein: 

(a) the U3-free 5' LTR is positioned 5' of the non-retroviral gene 
and has a transcriptional promoter and enhancer different from that of the original 
retroviral promoter and enhancer replacing the original U3 region of the 5 'LTR; 

(b) the partially deleted 3' LTR is positioned 3* of the non-retroviral 
gene and has no U3 sequences except for those required at the attachment site for 
viral integration; 

(c) an exogenous polyadenylation addition signal sequence 
recognized by the selected host cell and positioned on the vector 3* to the 3' LTR 
viral integration site; and 

(d) the internal recognized promoter is positioned adjacent to the 
non-retroviral gene on the vector to permit expression of the non-retroviral gene in 
the host cell; 

whereby the vector can produce progeny virus in a helper cell with 
the progeny virus being capable of infecting the selected hast cell and forming a 
provirus in the host cell, with the non-retroviral gene being expressible in the host 
cell, but the provirus in the host cell will be replication incompetent even in the 
presence of a helper virus. 

The present invention also pertains to a retrovirus produced by the 
recombinant retrovirus vector and a host cell containing the provirus of the present 
invention. 
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Detailed Description Of The Invention 



This invention relates to recombination-free, highly-efficient 
5 retroviral vectors that self-inactivate after one round of retroviral replication. The 
vectors allow cell-type specific gene expression from tissue specific promoters and 
enhancers and eliminate risks of downstream activation of cellular proto-oncogenes 
(tumorigenesis) that can occur with conventional vectors. The present self- 
inactivating vectors eliminate the problems of recombination and give rise to high 

10 gene transfer efficiency, up to 1,000 fold higher than that of existing self- 
inactivating vectors. The vectors described herein are derived from spleen necrosis 
virus but vector modification may be made with any retroviral vector leading to 
similar results. Thus, the novel vectors have a high potential for use in human gene 
therapy and in other gene transfer applications in which the cell-type specific gene 

15 expression is required, e.g., tissue specific gene expression in transgenic animals. 

%o eliminate vector recombination and improve the efficiency of 
gene transduction, two modifications were made to a self-inactivating retroviral 
vector. First, the U3 region of the left LTR was replaced by the immediate early 
2 0 gene promoter and enhancer of the cytomegalovirus (CMV, Figures 3 and 4). This 
promoter was shown to be the strongest among a series of promoters tested for gene 
expression in a large variety of cell-types including the SNV promoter in D17 cells. 
In order not to abolish retroviral replication, the substitution of the U3 promoter by 
the CMV promoter was done in such a way that initiation of transcription started at 

2 5 the beginning of R as in the wild-type virus. In particular, if RNA transcription is 

initiated upstream of the original initiation site, first strand cDNA synthesis would 
result in a cDNA product abolishing replication. It may be transferred and 
hybridize to the R-region of the second RNA molecule. However, such a cDNA 
most probably could not function as a primer, since the immediate 3' nucleotides of 

3 0 that cDNA would not find homologous sequences for hybridization. Since the 

precise initiation of transcription was difficult to predict, three constructs 
(designated as pPOlll-Rl, pPOHl-R2, and pP0111-R3) were made in which 
variable amounts of the U3 nucleotides upstream of the first nucleotide of R were 
retained (Figures 3 and 4). The function as well as frequencies of recombination of 
3 5 these constructs was tested as described earlier (Dornburg and Temin, 1988; Olson 
et ah, 1992) and as described below. Second, the encapsidation region, as present 
in previous SNV derived vectors, was extended. Such additional sequences 
increase the efficiency of encapsidation of the vector genome into retroviral 
particles (Embretson and Temin, 1987). 
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In one embodiment of the present invention, the vector is a 
recombinant retrovirus vector having a normally replication incompetent retrovirus 
gene sequence with a foreign eukaryotic gene. The retrovirus gene sequence is 
5 prepared with a deficiency in the retrovirus promoter so that the vector can still 
produce progeny virus in a helper cell, with the progeny virus being capable of 
infecting a selected eukaryotic host cell, forming a provirus, and expressing the 
eukaryotic gene in the host cell, but the provirus will be defective in the retrovirus 
promoter sequence. Preferably, the vector is a recombinant plasmid. In another 

10 embodiment, a foreign internal promoter is positioned adjacent to the foreign 
eukaryotic gene on the vector to permit expression of the foreign eukaryotic gene in 
the eukaryotic host cell without initiating retroviral provirus gene expression. In 
one form, the reading direction of the foreign promoter is inverted relative to the 
normal reading direction of the retrovirus gene sequence, and a foreign 3* RNA 

15 processing sequence is positioned on the side of the foreign eukaryotic gene 
sequence which is opposite to the foreign promoter. In another embodiment, the 
retrovirus is abnormally replication incompetent retrovirus of the type having a 
retrovirus portion and a foreign eukaryotic portion, the retrovirus portion having a 
deficient promoter portion, such that the virus is capable of infecting a eukaryotic 

2 0 host cell, forming a provirus, and expressing a eukaryotic protein coded for by the 
foreign eukaryotic portion in the host eel), but the provirus will be defective in a 
retrovirus promoter such that retroviral provirus gene expression doesn't take place 
in the host cell. 

2 5 The present invention allows one to select a eukaryotic gene of 

interest, insert the gene into a vector designed in accordance with the present 
invention, transfect a helper cell with the vector, harvest virus stock from the helper 
cell, use the harvested progeny virus to infect a target cell, and have the proviruses 
which are formed in the target cells express the inserted eukaryotic gene without 

3 0 expressing any retroviral proteins. Since there is no retroviral promoter that is 

active on the provirus, endogenous helper proteins cannot trigger production of a 
virus from the provirus. Since there is no retroviral promoter in the provirus, the 
provirus cannot provide a retrovirus signal that might trigger the host cell to act in 
an unintended way. The lack of retroviral promoter stops production of retroviral 
35 RNA. This system renders much more likely the acceptability of recombinant 
retrovirus as drugs for vertebrates. 

In the present invention, the enhancer and the promoter sequences of 
the retrovirus present at U3 in SNV have been deleted from the right side of the 
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DNA sequence in the plasmid vector (3'LTR). Only the sequences required at the 
attachment site for viral integration in the 3' LTR are present. The U3 5' LTR has 
a transcriptional promoter and enhancer different from that of the original retroviral 
promoter and enhancer to replace the original U3 region of the 5'LTR. The total 
5 lack of homology in the two U3 sequences prevent recombination. The U3 
sequence normally present in the 5' LTR has been reported to be used as a template 
to repair a partially deleleted U3 sequence in the 3* LTR. By eliminating the U3 
sequence in the 5' LTR, the template for repair of the partially deleleted U3 
sequence in the 3* LTR is also eliminated. The vector DNA is used to transfect 

10 helper cells in a conventional manner. Because transcription from the vector begins 
at R on the left side, and because the rornoter on the left side is not defective, virus 
can be harvested from the transfected helper cell in the conventional manner. 
Target cells can then be infected with the harvested virus. Since the right side U3 
supplies the coding sequences for both U3 segments in the resulting provirus, the 

15 transcriptional promoter which was originally deleted on one side of the plasmid 
DNA shows up as being deleted from both sides in the resulting provirus. The 
vector therefore permits a stock of the retrovirus progeny virus to be grown up, yet 
will not permit further replication after one infection cycle. 

2 0 If an internal promoter is not inserted, there will be no promoter to 

produce the desired foreign eukaryotic gene expression. Preferably such a 
promoter is positioned immediately adjacent to the foreign eukaryotic of interest so 
that no intervening retrovirus genes RNA is expressed. Because deletion of most of 
the U3 in SNV resulted in a loss of correct 3' end processing of viral RNA, even 
25 though AAUAAA was still present, a polyadenylation site was added to the vector. 

Moreover, problem recombinations are unlikely because the U3 sequences in the 
vector are not homologous. 

Abbreviations used in the present invention are as follows: pro- 

3 0 promoter; enh-enhancer; PBS-primer binding site for DNA synthesis; PPT- 

polypurine track for DNA synthesis; E-encapsidation sequences for RNA 
packaging; attR+-a sequence that will form the right side of the attachment site 
relating to integration; attL-b-the sequence that will form the left side of the 
attachment side relating to integration; attL4--the deletion of the original provirus 
35 left-side attachment site; and attR-f-the deletion of the original right side 
attachment site. 

The term "oligonucleotide" as used herein refers to primers, probes, 
oligomer fragments to be detected, oligomer controls, and unlabeled blocking 
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oligomers. Oligonucleotide are molecules comprised of two or more 
deoxyribonucleotides or ribonucleotides. The term "primer" as used herein refers 
to an oligonucleotide, preferably an oligodeoxyribonucleotide, either naturally 
occurring such as a purified restriction digest or synthetically produced, which is 
5 capable of acting as a point of initiation of synthesis when subjected to conditions in 
which synthesis of a primer extension product, which is complementary to a nucleic 
acid strand, is induced, i.e., in the presence of nucleotides, an agent for 
polymerization such as a DNA polymerase, and a suitable temperature and pH. 
The primer must be sufficiently long to prime the synthesis of extension products in 
10 the presence of the polymerization agent. 

FIGURE 1 is a diagram illustrating retroviral helper cells producing 
recombination-free, self-inactivating retroviral vectors. FIGURE 1A shows cells 
(preferentially D17 cells = canine cells) transfected with two different plasmids 
15 expressing the retroviral gag-pol and env genes, respectively. These genes are 
expressed from promoters and enhancers different from the vector long terminal 
repeat LTR promoter and enhancer. FIGURE IB shows that after the transfection 
of the retroviral vector genome into such helper cells, viral particles are produced 
that contain the RNA vector genome with the gene of interest. 

20 

FIGURE 2 is a diagram illustrating retroviral vector replication. 
FIGURE 2A shows an example of a conventional retroviral vector provirus as 
present in the helper cell shown at the top. This vector contains wild-type long 
terminal repeats with full-length U3, R, and U5 regions. RNA transcription yields 

25 the RNA transcript shown below. This RNA contains an encapsidation sequence 
(E) and all other cis-acting sequences necessary for replication (pbs = primer 
binding site, ppt — polypurine tract). After infection of a fresh target cell the 
retroviral genome is reverse transcribed into a double-stranded DNA copy. During 
this process, the U3-region which contains the retroviral promoter and enhancer and 

30 which is present at the 3' end of the RNA genome is duplicated and attached to the 
5' end of the DNA copy. In a similar way, the U5 region, present only at the 5 1 
end of the RNA genome, is also duplicated and attached to the 3* end of the DNA 
copy. The double-stranded DNA copy is integrated into the genome of the infected 
cell. Due to the presence of two long terminal repeats which contain the retroviral 

3 5 enhancer and promoter, tissue specific gene expression cannot be obtained. 

Further, the downstream enhancer and promoter can activate gene expression of 
genes located downstream of the integrated provirus (indicated by an arrow). 
FIGURE 2B illustrates the principle of a self-inactivating retroviral vector. The 
vector present in the helper cell is shown at the top. This vector is almost identical 
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to the one shown above. However, the long terminal repeat at the right side has 
been modified in the following way: most of US-sequences including the promoter 
and enhancer sequences have been deleted. Maintenance of some U3-sequences 
that form the attachment site (att) is essential for the efficient integration of the 
5 vector genome. The RNA transcribed from the U3-minus vector virus does not 
contain the retroviral promoter and enhancer. Thus, after one round of retroviral 
replication, a promoter-less retroviral provirus is formed. Genes can be expressed 
from internal tissue-specific promoters (pro). 

10 FIGURE 3 is a diagram illustrating retroviral vectors derived from 

spleen necrosis virus (SNV). The SNV genome (provirus) is shown at the top. An 
example of a standard retroviral derived from SNV is shown below (pJD214HY). 
In all SNV-derived standard vectors, the protein coding regions (gag-pol and env, 
the region from Sail to the 3 1 end of env) have been replaced with the gene(s) of 

15 interest (in pJD214HY with hygromycin resistance gene, referred to as hygro). 
pJD220SVHY is a first generation self-inactivating retroviral vector in which the 
retroviral promoter and enhancer of the right LTR have been deleted (designated as 
U3 minus; for more details, see Figure 2B). To increase efficiency of replication, 
the polyadenylation sequence of simian virus 40 (SV40ter) has been inserted 

2 0 downstream of the U3-minus LTR. To avoid problems of recombination of first 

generation U3-minus vectors in the pPOl 1 1 vectors, the U3 region of the left LTR 
has been almost completely deleted (except a few nucleotides to guarantee correct 
initiation of transcription) and substituted for the immediate early promoter and 
enhancer of the human cytomegalovirus (CMV, for more details, see also Figure 
25 4), Further, this second generation, U3-free vector contains additional retroviral 
sequences (from Sail to Eagl) to increase the efficiency of encapsidation. A 
universal, second generation U3-free vector (pPOH5-Rl) contains a multiple 
cloning site replacing the promoter (SV40pro) and hygromycin B resistance gene 
(hygro) as present in pJD220SVHY and pPOlll-Rl (from Eagl to Clal). The 

3 0 multiple cloning site has been derived from pBluescript II (Eagl to Clal) and 

allows the easy insertion of various genes and promoters to give new retroviral 
vectors. 

FIGURE 4 is a diagram illustrating the nucleotide sequences at the 
3 5 junction of the cytomegalovirus (CMV) with spleen necrosis virus sequences as 
present in the pPOlll and pPOH5 vector series. An except of the SNV genome 
showing the TATA box and initiation of transcription is shown at the top. The 
Sacl site downstream of the TATA box of the CMV immediate early promoter 
(shaded sequence) was used for cloning to connect the CMV promoter and enhancer 
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to the SNV sequences. pPOHl-R2 and pPOHl-R3 are 3 or 9 nucleotides shorter 
than pPOlll-Rl, respectively. 

The function of the new vectors was tested in a tissue culture system 
5 as follows and as described earlier (Olson et al M 1992): The new vector constructs 
pPOlll-Rl, pP0111-R2, and pPOHl-R3 were transfected into helper cells. In 
parallel, the retroviral vectors pJD214HY, a standard retroviral vector with two 
complete wild-type LTRs, and pJD220SVHY, the first generation self-inactivating 
vector from which the new vectors have been derived (Figure 3), were also 

10 transfected into helper cell-lines. Transfected cells were selected for hygromycin 
resistance and cell-lines were established. These resulting helper cell lines are 
designated as step 1 cells. Virus was harvested from confluent step 1 cultures and 
fresh helper cells were infected. The infected helper cells were termed step 2 cells. 
Virus titers were determined (Table 1). We show that the new vectors pPOlll-Rl 

15 to pPOHl-R3 were more efficient than pJD220SVHY. They were almost as 

efficient as a vector with two wild-type LTRs (pJD214HY). 

h 

To test for recombination, virus particles were harvested from 
confluent step 2 cell cultures (mass infection) and fresh D17 cells were infected. 

20 Detection of hygromycin resistant colonies indicates that recombination 
reconstituted the U3 region: due to the lack of control sequences in RNA 
transcripts derived from completely U3- minus proviruses, such vectors are not 
further passaged by retroviral proteins. Thus, only vectors with complete, repaired 
LTRs are transferred to new target cells. We found that pJD220SVHY was further 

25 passaged with high efficiency confirming earlier findings (Olson et aL, 1992). The 
new vectors pPOlll-Rl to R3, however, were not further transferred from step 2 
helper cells (Table 1). These data show that the new vectors are resistant to 
recombination events that reconstitute the LTR region. 
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TABLE 1 

EFFICIENCY OF INFECTTVITY AND RECOMBINATION OF SELF- 
INACTIVATING RETROVIRAL VECTORS DERIVED FROM SNV 

5 

VIRUS TITER (CFU) 



VECTOR NAME 


1. INFECTION 


2 . INFECTION 


pJD214HY 


6 X 105 


5 X 106 


pJD22 0SVHY 


8 X 104 


1 X 103 


pPOlll-Rl 


4 x 105 


0* 


pP0111-R2 


2.5 x 105 


0* 


pPOlll-Rl 


1.5 x 105 


0* 



* No hygromycin resistant colonies were obtained with pPOlll vectors. 

15 

The retroviral vectors (see also Figure 3) were transfected into helper 
cells followed by hygromycin selection. Virus was harvested from confluent 
cultures and fresh helper cells were infected (referred to as first infection). Infected 
cells were selected for hygromycin resistance, and vims titers were determined 
2 0 (expressed as colony forming units (CFU) per ml of supernatant medium). Cell- 
lines were established from infected tissue culture plates which contained more than 
1,000 independent hygromycin resistant colonies. Virus was harvested from 
confluent cell cultures and fresh D17 cells were infected (referred to as second 
infection). Infected cells were selected for hygromycin resistance and virus titers 

2 5 were determined. 

Examples 

Materials and Methods 

30 

Plasmid construction. 

The immediate early promoter and enhancer of cytomegalovirus, 
CMV (a 900 bp Hind3 fragment obtained from plasmid pJDCMV19SV, Dougherty 

3 5 et al., 1990) was cloned into plasmid pRD15 (described in Sheay et al., submitted 
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for publication). The resulting plasmid was termed pPOlOl. A DNA fragment 
containing most of the LTR, the encapsidation region, and parts of the gag-pol 
region of spleen necrosis virus (a Bali fragment derived from plasmid pSW253, 
described by Watanabe and Temin, 1982) was cloned into the blunted Sacl site of 
5 pBluescript II KS. The resulting plasmid was termed pPO102. Plasmid pPO102 
was digested with Sacl and re-ligated after treatment with Klenow polymerase 1 
(creation of blunt ends) to eliminate a Sacl site present in the encapsidation 
sequence. The resulting plasmid was termed (pPO102-S). Using PCR technology 
and plasmid pPO102-S as template, three DNA fragments were generated 

10 comprising slightly different amounts of the SNV R, U5, and encapsidation region: 
a DNA fragment termed Rl comprises map units 383 to 986 of the SNV genome, 
fragment R2 comprises map units 385 to 986, and fragment R3 comprises map units 
392 to 986. All PCR primers contained Sacl recognition sites at their 5' ends. 
After Sacl digestion, fragments Rl, R2, and R3 were cloned into the Sacl site of 

15 pBluescript II KS. The resulting plasmids were termed pPO103-Rl, pPO103-R2 
and pPO103-R3, respectively. Fragments Rl to R3 were isolated from such 
plasmids and cloned into the Sacl site of pPOlOl. The resulting plasmids were 
termed pPO104-Rl, pPO104-R2, and pPO104-R3, respectively. 

2 0 Plasmid pJD220SVHY (Dougherty and Temin, 19986) was digested 

with Sspl and Sacl. After Klenow poll treatment, a linker coding for the 
recognition site of the restriction enzyme Bgl2 was ligated. This procedure 
removed the left LTR promoter, and all of the encapsidation sequences present in 
this vector. The resulting plasmid was termed pPO106. DNA fragments derived 

25 from plasmids pPO104-Rl, pPO104-R2, and pPO104-R3 comprising the CMV 
promoter, R, U5, and the encapsidation sequence (Bgl2 fragments) were isolated 
and inserted into plasmid pPO106 digested with BgI2. The resulting plasmids were 
termed pPOlll-Rl, pPO!ll-R2, and pPOHl-R3, respectively. The universal 
retrovirus vectors pPOH5-Rl and PPO! 15-R2 were made by replacing the SV40 

30 promoter and hygromycin B phosphotransferase gene of pPOlll-Rl and pPOlll- 
R2 (Eagl to Clal fragment) with the multiple cloning site (Eagl to Clal) of 
pBluescript II KS. 
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DNA sequence pPOlll-Rl 5528 b.p. complete sequence; 

cagatctgagcttgccattgcatacgttgtatccatatcataatatgtacat7tatattggctca 

7g7ccaaca77accccca7g7tgaca7tgattattgactagtta7taa~ag7aa7caattacggg 
5 gtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatggcccgcctg 

gctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgcca 

atagggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtaca 

tcaagtgtatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggc 

attatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatc 
10 gctattaccatggtgatgccgttttggcagtacatcaatgggcgtggatagcggtttgactcacg 

gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacggg 

actttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgg 

gaggtctatataagcagagctcatctcttgctcggggtcgccgtcctgcacattgttgttgtgac 

gtgcggcccagattcgaatctgtaataaaacttttttttttctgaatcctcagattggcagtgag 
15 aggagattttgttcgtggtgttgcctggcctactgggtgggcgcagggatccggactgaatccgt 

agtacttcggtacaacatttgggggctcgtccgggataccctccccatcggcagaggtgccaact 

gcttcttcgaactttcttcgaactccggcgccggtgagttaagtacttgattttggtacctcgcg 

agggtt7gggaggatcggag7gg7ggcgggacgctgccgggaagc7ccacc7ccgctcagcaggg 

gacgccctgacctgagctcgaattcagatcttgtggtatctgattgttgttgagccgtccctaag 
20 acggtgatactaagtcgtggcttgtgtgtttgtttgttgccttgtgtttgttcgtcgtttgtcga 

cagcgccttgcgaattggtgtacccacaccgcgcggcttgcgaataatactttggagagcctttt 

gcctcca6tgtcttccgtctgtactcgtcctcctctccctctccggccgggagagctcgcccggg 

gatcctctagtatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagtatac 

actccgctatcgctacotgactgggtcatggctgcgccccgacacccgccaacacccgctgacgc 
25 gccctgacgggcttgtcVgctcccggcatccgcttacagacaagctgtgaccgtctccgggagct 

gcatgtgtcagaggttttcaccgtcatcaccgaaacgcgcgaggcagctgtggaatgtgtgtcag 

ttagggtgtggaaagtccccaggctccccagcaggcagaagta7gcaaagcatgcatctcaatta 

gtcagcaaccaggctccccagcaggcagaagtatgcaaagcatgcatctcaattagtcagcaacc 

atagtcccgcccctaactccgcccatcccgcccctaactccgcccagttccgcccattctccgcc 
3 0 ccatggctgactaattttttttatttatgcagaggccgaggccgcctcggcctctgagctattcc 

agaagtagtgaggaggcttttttggaggcctaggcttttgcaaaaagctctagagtcgacctgca 

gcccaagctgatccccgggggcaatgagatatgaaaaagcctgaactcaccgcgacgtctgtcga 

gaagtttctgatcgaaaagttcgacagcgtctccgacctgatgcagctctcggagggcgaagaat 

ctcgtgctttcagcttcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccgat 

3 5 ggtttctacaaaga7cgttatgtttatcggcac ttgcatcggccgcgctcccgattccggaagt 

GCTTGACATTCGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCA 
CGTTGCAAGACCTGCCTGAAACCGAACTGCCCGCTGT7CTGCAGCCGGTCGCGGAGGCCATGGAT 
GCGA7CGC7GCGCCCGATCTTAGCCAGACGAGCGGG77CGGCCCA77CGGACCGCAAGGAA7CGG 
TCAATACAC7ACATGGCGTGATTTCA7A7GCGCGAT7GCTGA7CCCCA7G7GTATCACTGGCAAA 

4 0 C7G7GA7GGACGACACCG7CAGTGCGTCCGTCGCGCAGGC7C7CGATGAGCTGATGC7TTGGGCC 

GAGGACTGCCCCGAAGTCCGGCACCTCGTGCACGCGGA7T7CGGC7CCAACAA7G7CCTGACGGA 
CAATGGCCCCA7AACAGCGG7CA77GAC7GGAGCGAGGCGA7GTTCGGGGA77CCCAATACGAGG 
7CGCCAACA7C7TC7TC7GGAGGCCG7GGTTGGCT7G7ATGGAGCAGCAGACGCGC7ACT7CGAG 
CGGAGGCA7CCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCG7ATATGCTCCGCA77 GGTCTTGA 

4 5 CCAACTC7A7CAGAGCTTGGTTGACGGCAAT77CGA7GATGCAGCT7GGGCGCAGGG7 CGATGCG 

ACGCAATCG7CCGA7CCGGAGCCGGGAC7GTCGGGCG7ACACAAA7CGCCCGCAGAAGCGCGGCC 
G7C7GGACCGA7GGC7GTG7AGAAG7ACTCGCCGATAG7GGAAACCGACGCCCCAGCACTCG7GG 
GATCAGCTTGGCACTCGAATTAATTCTCATGT7TGACAGCTTATCATCGATAAGC7C7AGGTCAA 
7GG7T7GACCAGAACATACAAGAGCAGTGGGGAATG7GGGAGGGCC7CGAGGTCGGGGTCGCCG7 
50 CC7ACACATTG7TG77G7GACGTGCGGCCCAGAT7CGAATC7GTAA7AAAACCT77TTT7AAC7G 
AA7CC7CAGATTGGCAGTGAGAGGAGA7TT7G77CG7GGTGTTGGCTGGCC7AC7GGGTGGGCGC 
AGGGA7CGATCCAGACATGA7AAGA7ACA77GA7GAGTTTGGACAAACCACAAC7AGAATGCAGT 
GAAAAAAA7GCTT7A7TTG7GAAATTTGTGA7GCTA77GCT77A7T7GTAACCAT7ATAAGC7GC 
AA7AAACAAGT7AACAACAACAAT7GCA77CAT77TA7GTT7CAGGTTCAGGGGGAGGTG7GGGA 

5 5 GG777777AAAGCAAG7AAAACC7C7ACAAA7CAAGC7GATCCCTGCCTCGCGCG777CGG7GA7 

GACGG7GAAAACCTC7GACACA7GCAGC7CCCGGAGACCGTCACAGCT7GTC7GTAACCGGA7CC 
CGGGAGCAGACAAGCCCGTCAGGGCGCG7CAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCA7GA 
CCCAGTCACG7AGCGA7AGCGGAG7G7A7AC7GGCTTAACTA7GCGGCATCAGAGCAGA77G7AC 
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7GAGAG7GCACCA7A7GCGG7G7GAAA7ACCGCACAGA7GCG7AAGGAGAAAA7ACCGCA7CAGG 

CGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATC 

AGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCA2GGGATAACGCAGGAAAGAACATGT 

GAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGG 
5 CTCCGCCCCCC7GACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGG 

ACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGC 

CGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGC 

TGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGT 

TCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACT 
10 TATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACA 

GAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCT 

GCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTG 

GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGAT 

CCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGT 
15 CATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAA 

TCTAAAGTATATATGAGTAAACTTGG7CTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATC 

TCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGAT 

ACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTC 

CAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTA 
2 0 TCCGCCTCCATCCAGTCTATTAATTGTTGCCGGCAAGCTAGAGTAAGTAGTTCGCCAGTTAATAG 

TTTGCGCAACGTTGTTGCCATTGCTGCAGGCA7CGTGGTG7CACGCTCGTCGTTTGGTATGGCTT 

CA77CAGC7CCGGT7CCCAACGATCAAGGCGAG77ACA7GA7CCCCCATGTTGTGCAAAAAAGCG 

GT7AGCTCC77CGG7CC7CCGA7CG77G7CAGAAG7AAGT7GGCCGCAGTGTTA7CACTCA7GG7 

7A7GGCAGCAC7GCA7AA77C7C77AC7G7CA7GCCA7CCG7AAGA7GC7777C7G7GAC7GG7G 

2 5 AG7AC7CAACCAAG7C^77C7GAGAA7AG7G7A7GCGGCGACCGAGTTGCTCTTGCCCGGCG7CA 

ACACGGGA7AA7ACCGCGCCACATAGCAGAAC777AAAAGTGC7CATCATTGGAAAACG77C77C 
GGGGCGAAAAC7C7CAAGGA7C77ACCGC7G77GAGA7CCAG77CGATGTAACCCACTCG7GCAC 
CCAAC7GA7C77CAGCA7C7777AC777CACCAGCG777C7GGGTGAGCAAAAACAGGAAGGCAA 
AA7GCCGCAAAAAAGGGAATAAGGGCGACACGGAAA7G77GAA7ACTCATAC7C7TCC7T777CA 

3 0 ATC 



DNA sequence pPOHl-R2 5525 b.p. complete sequence; 

3 5 CAGA7C7GAGC77GCCA7TGCA7ACG77G7A7CCA7A7CA7AA7ATGTACATT7A7ATTGGC7CA 

7G7CCAACA77ACCGCCATG77GACA77GA77A77GAC7AG77A7TAATAGTAATCAAT7ACGGG 
G7CA7TAG77CA7AGCCCA7A7ATGGAGT7CCGCG77ACATAAC7TACGGTAAA7GGCCCGCC7G 
GC7GACCGCCCAACGACCCCCGCCCA77GACG7CAA7AA7GACGTAT GTTCCCATAGT AACGCCA 
A7AGGGAC777CCAT7GACG7CAA7GGG7GGAG7A77TACGG7AAACTGCCCAC77GGCAG7ACA 

4 0 7CAAG7G7A7CA7A7GCCAAG7ACGCCCCC7A77GACG7CAA7GACGGTAAA7GGCCCGCC7GGC 

A77A7GCCCAG7ACA7GACC77A7CGGAC77TCCTAC77GGCAGTACA'CTACG7A7TAG7CA7C 
GC7A77ACCA7GG7GA7GCGG7777GCCAG7ACA7CAA7GGGCGTGGA-AGCGG777GAC7CACG 
GGGA777CCAAG7C7CCACCCCA77GACG7CAA7GGGAG77TG77TT GGCACCAAAA7 CAACGGG 
AC777CCAAAA7G7CG7AACAAC7CCGCCCCA77GACGCAAA7GGGCGG7AGGCG7G7ACGG7GG 
45 GAGG7C7A7A7AAGCAGAGC7C7CT7GC7CGGGG7CGCCG7CC7GCACA77G77G77GTGACG7G 
CGGCCCAGA77CGAA7C7G7AA7AAAAC7777777777C7GAA7CC7CAGA77GGCAG7GAGAGG 
AGA7777G77CG7GG7G77GCC7GCCC7AC7GGG7GGGCGCAGGGA7CCGGAC7GAA7CCG7AG7 
AC77CGG7ACAACA777GGGGGC7CG7CCGGGA7ACCC7CCCCATCGGCAGAGG7GCCAAC7GCT 
7C77CGAACT77CT7CGAAC7CCGGCGCCGG7GAG7TAAG7AC77GA77 77GG7ACC7CGCGAGG 

5 0 G777GGGAGGA7CGGAGTGG7GGCGGGACGC7GCCGGGAAGC7CCACCTCCGC7CAGCAGGGGAC 

GCCC7GACC7GAGC7CGAA77CAGA7C77G7GG7A7C7GA77G7TGT TGAGCCG7CCC7AAGACG 
G7GA7AC7AAG7CG7GGC77G7G7G777G7T7G77GCC77G7G77TG77CG7CG777GTCGACAG 
CGCC77GCGAA77GG7G7ACCCACACCGCGCGGC77GCGAA7AA7AC7 7 7GGAGAGCC7777GCC 
7CCAG7G7C77CCG7C7G7AC7CG7CC7CC7C7CCC7C7CCGGCCGGGAGAGC7CGCCCGGGGAT 
55 CC7C7AG7A7GG7GCAC7C7CAG7ACAA7C7GC7C7GA7GCCGCA7AG77AAGCCAG7A7ACAC7 
CCGC7A7CGC7ACG7GAC7GGG7CA7GGC7GCGCCCCGACACCCGCCAACACCCGC7GACGCGCC 
C7GACGGGC7TG7CTGC7CCCGGCA7CCCC77ACAGACAAGC7G7GACCG7C7CCGGGAGC7GCA 
7G7G7CAGAGG7777CACCG7CA7CACCGAAACGCGCGAGGCAGC7G7GGAA7G7GTG7CAG77A 
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GGG7G7GGAMGTCCCCAGGCTCCCCAGCAGGCAGAAGTA7GCAAAGCATGCATCTCAATTAGTC 
AGCAACCAGGCTCCCCAGCAGGCAGAAGTATGCAAACCA7GCATCTCAATTAGTCAGCAACCATA 
GTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCoCCCATTCTCCGCCCCA 
TGGCTCACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 
5 AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCTAGAG7CGACCTGCAGCC 
CAAGCTGATCCCCGGGGGCAA7GAGATATGAAAAAGCCTGAACTCACCGCGACGTCTGTCGAGAA 
GTTTCTGATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTC 
GTGCTTTCAGCTTCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGT 
TTCTACAAAGATCGTTATGTTTATCGGCACTTTGCA7CGGCCGCGCTCCCGATTCCGGAAGTCCT 

10 TGACATTGGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGT 
TGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAGGCCATGGATGCG 
ATCGCTGCGGCCGA7CTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGGTCA 
A7ACAC7ACA7GGCG7GATTTCA7A7GCGCGATTGC7GA7CCCCATG7GTATCACTGGCAAACTG 
7GATGGACGACACCG7CAGTGCG7CCGTCGCGCAGGC7CTCGA7GAGCTGA7GC7TTGGGCCGAG 

15 GACTGCCCCGAAG7CCGGCACC7CGTGCACGCGGATT7CGGC7CCAACAATGTCCTGACGGACAA 
7GGCCGCATAACAGCGG7CATTGACTGGAGCGAGGCGATG77CGGGGATTCCCAATACGAGGTCG 
CCAACATCT7C77C7GGAGGCCGTGGT7GGCT7GTATGGAGCAGCAGACGCGC7AC7TCGAGCGG 
AGGCATCCGGAGC77GCAGGATCGCCGCGGCTCCGGGCGTA7A7GCTCCGCA77GGTCTTGACCA 
ACTCTA7CAGAGC77GG77GACGGCAA777CGATGATGCACC77GGGCGCAGGGTCGATGCGACG 

2 0 CAATCGTCCGA7CCGGAGCCGGGAC7GTCGGGCGTACACAAA7CGCCCGCAGAAGCGCGGCCGTC 
TGGACCGATGGC7G7G7AGAAGTACTCGCCGA7AGTGGAAACCGACGCCCCAGCACTCGTGGGAT 
CAGC77GGCAC7GGAA77AATTC7CATGTTTGACAGCTTA7CATCGATAAGCTCTAGGTCAATGG 
TT7GACCAGAACA7ACAAGAGCAG7GGGGAATG7GGGAGGGCC7CGAGGTCGGGGTCGCCG7CCT 
ACACAT7GTTGT7G7GACG7GCGGCCCAGAT7CGAATCTG7AA7AAAACCTTT7TTTAACTGAAT 

2 5 CC7CAGA7TGGCAGTGA^AGGAGATTT7G77CGTGGTGTTGGC7GGCCTAC7GGGTGGGCGCAGG 

GATCGA7CCAGACA7GA7AAGATACAT7GA7GAGTTTGGACAAACCACAAC7AGAATGCAGTGAA 
AAAAATGCTTTA7TTG7GAAATT7GTGATGC7ATTGCTTTA7TTG7AACCATTATAAGCTGCAAT 
AAACAAGT7AACAACAACAATTGCATTCATTT7A7GTTTCAGG77CAGGGGGAGGTGTGGGAGGT 
777T7AAAGCAAGTAAAACC7C7ACAAATCAAGCTGATCCCTGCCTCGCGCGT77CGGTGA7GAC 

3 0 GG7GAAAACC7C7GACACA7GCAGC7CCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGG 

GAGCAGACAAGCCCG7CAGGGCGCGTCAGCGGG7GT7GGCGGG7G7CGGGGCGCAGCCATGACCC 
AG7CACGTAGCGATAGCGGAGTG7ATAC7GGC7TAAC7ATGCGGCATCAGAGCAGA7TGTAC7GA 
GAGTGCACCA7ATGCGG7G7GAAA7ACCGCACAGAT GCGTAAGGAGA A AAT ACCGCAT CAGGCGC 
7C7TCCGCTTCC7CGC7CACTGAC7CGC7GCGCTCGGTCGT7CGGCTGCGGCGAGCGGTA7CAGC 

3 5 7CACTCAAAGGCGGTAATACGG77ATCCACAGAATCAGGGGA7AACGCAGGAAAGAACA7G7GAG 

CAAAAGGCCAGCAAAAGGCCAGGAACCG7AAAAAGGCCGCGT7GCTGGCGTTTT7CCATAGGCTC 
CGCCCCCCTGACGAGCA7CACAAAAATCGACGC7CAAGTCAGAGGTGGCGAAACCCGACAGGAC7 
A7AAAGA7ACCAGGCGT77CCCCCTGGAAGCTCCC7CGTGCGC7CTCCTGTTCCGACCCTGCCGC 
77ACCGGATACCTG7CCGCC777C7CCC77CGGGAAGCGTGGCGCTTTCTCATAGCTCACGC7G7 

4 0 AGG7ATC7CAGT7CGG7G7AGG7CGTTCGC7CCAAGCTGGGC7G7G7GCACGAACCCCCCGT7CA 

GCCCGACCGCTGCGCCT7A7CCGGTAACTATCGTC77GAG7CCAACCCGG7AAGACACGAC77A7 
CGCCAC7GGCAGCAGCCAC7GG7AACACGA7TAGCAGAGCGAGG7A7GTAGGCGG7GC7ACAGAG 
77C7TGAAGTGG7GGCC7AAC7ACGGCTACACTAGAAGGACAGTATTTGGTATC7GCGC7CTGCT 
GAAGCCAGT7ACC7TCGGAAAAAGAGTTGGTAGC7C7TGA7CCGGCAAACAAACCACCGC7GG7A 

45 GCGGTGG7T777TTGT77GCAAGCAGCAGAT7ACGCGCAGAAAAAAAGGATCTCAAGAAGATCCT 
T7GATC77TTC7ACGGGG7C7GACGCTCAGTGGAACGAAAACTCACGTT AAGGGA7TTTGG7CAT 
GAGAT7ATCAAAAAGGA7C77CACCTAGATCC7T77AAAT7AAAAATGAAG7T7TAAATCAA7CT 
AAAGTA7ATA7GAGTAAAC77GG7C7GACAGT7ACCAA7GCT7AA7CAGTGAGGCACCTATC7CA 
GCGATC7GTCTATTTCG77CA7CCATAG7TGCCTGACTCCCCGTCGTGT AGATAACTACGA7ACG 

50 GGAGGGCTTACCATCTGGCCCCAG7GC7GCAA7GATACCCCGAGACCCACGCTCACCGGC7CCAG 
AT7TATCAGCAA7AAACCACCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTT7A7CC 
GCCTCCA7CCAG7CTA77AA77G77GCCGGGAAGC7AGAGTAAGTAGTTCGCCAGTTAATAG777 
GCGCAACGT7GTTGCCAT7GC7GCAGGCA7CGTGGTG7CACGC7CGTCGTTTGGTATGGC7TCAT 
7CAGCTCCGG77CCCAACCA7CAAGGCGAGT7ACATGA7CCCCCATGTTGTGCAAAAAAGCGGTT 

5 5 AGCTCC7TCGG7CCTCCGA7CG7TG7CAGAAGTAAGT7GGCCGCAGTGT TATCAC7CATGG7TA7 

GGCAGCACTGCA7AAT7CTCTTACTG7CATGCCATCCGTAAGATGC7777C7G7GAC7GG7GAGT 
AC7CAACCAAGTCATTCTGAGAA7AG7GTA7GCGGGGACCGAGTTGCTCT7GCCCGGCG7CAACA 
CGGGA7AATACCGCGCCACATAGCAGAAC7T7AAAAG7GCTCATCA77GCAAAACG77CT7CGGG 
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GCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCArrrfl 

gccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttJwc 
DNA sequence pP0111-R3 5519 b.p. complete sequence; 

CAGATCTGAGCTTGCCArTGCATACGTTGTATCCATATCATAATATGTACATTTATAITGGCTCA 

TGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGG 
""TTAGTTCATAGCCCATATATCGAGTTCCGCGTTACATAACTTACCGTA ccg 

tcaagtgtatcatatgccmgtacgccccctattgacgtcmtgacggtaaatggcccgJctgg; 

"TATGCCGAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTAC " 5 
GGGATrTCCAAGTCTCCACCCCATTGACGrcA>rGGGAGTTTGTTTT G n cacc JI! T ^ A T "" 

GAGGTCTATATAAGCAGAGCTCTCGGGGTCGCCGTCCTGCACATTGTTGTTGTGACGTGCGGCCC 
TGTTCGTGGTGTTGCCTGGCCTACTGGGTGGGCGCAGGGATCCGGACTGAATCCGTAGTACTTCG 

AACrTTCTTCGAACTCCGGCGCCGGTGAGTTAAGTACTTGATTTTGGTACCTCGCGAfrrTTTrr 
GAGGATCGGAGTGGTGGCGGGACGCTGCCGGGAAGCTCCACCTCCGCTCAGCAGGGGACGC^ 

CTMGTCGTGGCTTGTpGTTTGnTGrTGGCTTGTGTTTGTTCGTCGTTTGTcl^^ 
GCGAATTGGTGTACCCACACCGCGCGGCTTGCGAATAATACTTTGGAGAGCCTTTTGCCTCCACT 

GTATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATACACTCCcaA 

GGCTTGTCTGCTCCCGGCATCCGCTTACAGACMGCTGTGACCGTCTCCGGGAGCTG A G "t c 
AGAGGTTTTCACCGTCATCACCGA^CGCGCGAGGCAGCTGTGGAATGrGTGTCAG A GGTCT 

GGAAAGTCCCCAGGCTCCCCAGMGGCAGAAGTATGCAAAGCATGCArCTCA^ATTAGTCAGCAAP 
CAGGCT^ 

A ^ 

rJZll TATGCAGAGGCCGAGGCCGCCT "GCCTCTGA G CTATTCCAGAAGTAGT 

GATCCCCGGGGGCAATGAGATATGAAAMGCCTGMCTCACCGCGACGTCTGTCGAGAAGTTTCT 
GATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTT 

AAAGATCGTTATGTTrATCGGCACTTTGCATCGGCCGCGCICCCGATTCCGGAAGTGCTTGACAT 

TG "^TTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGG^ S 

A CTGCCTGAAACCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAGGCCAT olZcl^ 

TA a3 CUAGCCA " 

ACGA„ CCGTCAGTCCGTCC0TCCCCMCGcTcTcMrG ^ 

° rCCGGCACCTCGTCCACCC ^ 

CATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAMTCGCCAACA 
T"TCTT C rGGAGGCCGTGGTTGGCTT G T A TGGAGCAGCAGACGCGCTAC S££S 

CCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGrATATGCTCCGCATTGGTCTTGACCAACTCTA 
TCC T,CGGAGCCGGGACTGTCGGGCGTACACAAATCGCCCGCAGAAGCGCGGCCGT^ 

g"ca?Z 

GGCACTGGAATTAATTCTCATGTTTGACAGCrrATGATCCATAAGCTCTAGGTCAATGGTTTGA 

"""TGTWCGTGCGGCCCAGATTCGAATCTGTAATAAAACCTTTTTTTAACT GAA CC GA 

TC.AGACATGATAAGATACAITGATGAGTTTGGACAAACCACAACTAGAATG-A-Tr-iAi.ArC 
GCTTTArTTGTGAMTTTGTGATGCTATTGCTTTATTTGTAACCAT"::::;^^ 
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GTTAACAACAACAATTGCATTCATTTTAT GTTTCAGGTTCAGGGGGAGGTGTGGGAGGI TTTTTA 
AAGCAAGTAAAACCTCTACAAATCAAGCTGATCCCTGCCTCGCGCGTTTCGGTGATGACGGTGAA 
AACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAG 
ACAAGCCCGTCAGGGCGCGTCAGCGGGTGT TG GCGGGTGTCGGGGCGCAGCCATGACCCAGTCAC 
5 GTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGC 
ACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCC 
GCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC 
AAAGGCGGTAATACGGT7ATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAG 
GCCAGCAAAAGCCCAGGAACCGTAAAAAGGCCGCGTTGCTCGCGTTTTTCCATAGGCTCCGCCCC 

10 CCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAG 
ATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCG 
GATACCTG7CCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTAT 
CTCAGTTCGGTGTAGGTCG7TCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGA 
CCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCAC 

15 TGGCAGCAGCCAC7GGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTG 
AAG7GG7GGCC7AAC7ACGGC7ACAC7AGAAGGACAG7A777GG7A7C7GCGC7C7GC7GAAGCC 
AGT7ACCT7CGGAAAAAGAG77GGTAGC7C77GATCCGGCAAACAAACCACCGC7GGT AGCGG7G 
GTT77777GTT7GCAAGCAGCAGAT7ACGCGCAGAAAAAAAGGATC7CAAGAAGA7CC7T7GA7C 
7T77CTACGGGG7C7CACGC7CAG7GGAACGAAAAC7CACG77AAGGGATTT7GG7CA7GAGA77 

2 0 ATCAAAAAGGA7C77CACC7AGA7CC77TTAAATTAAAM7GAAG77T7AAA7CAA7CTAAAGTA 
7A7ATGAGTAAAC77GGTC7GACAGTTACCAATGCT7AATCAG7GAGGCACCTATCTCAGCGATC 
7GTCTA7T7CG7TCA7CCA7AG77GCC7GAC7CCCCGTCG7G7AGATAAC7ACGA7ACGGGAGGG 
CTTACCA7C7CGCCCCAG7GC7GCAA7GA7ACCGCGAGACCCACGC7CACCGGC7CCAGA7TTA7 
CAGCAA7AAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAG7GG7CC7GCAAC7T7ATCCGCCTCC 

2 5 A7CCAGTCTAT7AA77d\7GCCGGGAAGC7AGAG7AAG7AG77CGCCAGTTAA7AG7TTGCGCAA 

CGT7G77GCCA77GC7GCAGGCA7CGTGGTG7CACGC7CG7CG777GGTA7GGC77CATTCAGC7 
CCGG77CCCAACGA7CAAGGCGAG7TACATGA7CCCCCA7GT7G7GC£AAAAAGCGGTTAGC7CC 
77CGG7CC7CCGA7CG77G7CAGAAGTAAGTTGGCCGCAG7G77A7CACTCA7GG77A7GGCAGC 
ACTGCA7AA7TC7C77AC7G7CA7GCCA7CCGTAAGATGC7777C7GTGAC7GGTGAG7AC7CAA 

3 0 CCAAGTCA77C7GAGAA7AGTG7ATGCGGCGACCGAGT7GC7C77GCCCGGCGTCAACACGGGA7 

AATACCGCGCCACA7AGCAGAAC7T7AAAAG7GCTCA7CA7TGGAAAACGT7CT7CGGGGCGAAA 
ACTC7CAACGATC77ACCGCTGT7GAGA7CCAG77CGA7G7AACCCAC7CG7GCACCCAAC7GA7 
C7TCAGCA7CT777AC777CACCAGCG7T7C7GGG7GAGCAAAAACAGGAAGGCAAAATGCCGCA 
AAAAAGGGAATAAGCGCGACACGGAAA7G77GAA7AC7CA7AC7C77CCTT7TTCAATC 

35 

DNA sequence PPOH5-R1 3878 b.p. complete sequence; 

CAGA7C7GAGC77GCCA77GCATACG77G7ATCCA7A7CA7AA7A7GTACA777A7AT TGGC7CA 

4 0 TGTCCAACATTACCGCCA7G77GACA7TGATTATTGACTAG77A7TAAT AG7AA7CAATTACGGG 

GTCA77AGT7CATAGCCCATATATGGAGT7CCGCGTTACATAACTTACGGTAAATDGCCCGCCTG 
GCTGACCGCCCAACGACCCCCGCCCATTGACGTCAA7AA7GACG7A7GTTCCCATAGTAACGCCA 
A7AGGGAC777CCA77GACGTCAATGGG7GGAG7ATT7ACGG7AAACTGCCCAC7TGGCAG7ACA 
7CAAG7G7A7CA7A7GCCAAGTACGCCCCC7AT7GACG7CAA7GACGGTAAA7GGCCCGCC7GGC 

4 5 A7TA7GCCCAG7ACA7GACC7TATGGGACTTTCC7AC77GGCAG7ACATCTACG7ATTAGTCA7C 

GC7A7TACCA7GG7GA7GCGG777TGGCAGTACA7CAATGGGCG7GGATAGCGG777GAC7CACG 
GGGA777CCAAG7CTCCACCCCA77GACG7CAA7GGGAGT77G777TGGCACCAAAATCAACGGG 
ACT77CCAAAATGTCGTAACAACTCCGCCCCA77GACGCAAATGGGCr;:;7AG3CGTG7ACGGTGG 
GAGG7C7A7A7AAGCAGAGC7CA7CTCTTGC7CGGGG7CGCCG7CCTG;.ACA77G77G-7GTGAC 

5 0 G7GCGGCCCAGA77CGAA7CTG7AATAAAAC7T777777T7C7GAATCCTCAGA77GGCAGTGAG 

AGGAGA7777G77CG7GG7G77GCCTGGCCTACTGGG7GGGCGCAGGGA7CCGGACTGAA7CCG7 
AG7AC77CGG7ACAACA777GGGGGC7CG7CCGGGA7ACCC7CCCCATCGGCAGAGGTGCCAAC7 
GC77C77CGAAC777C77CGAAC7CCGGCGCCGG7GAG77AAG7AC77GAT777GGTACCTCGCG 
AGGG777GGGAGGA7CGGAG7GG7GGCGGGACGC7GCCGGGAAGC7CCACCTCCGCTCAGCAGGG 
55 GACGCCC7GACC7GAGC7C7GTGG7A7CTGA77G77G77GAGCCG7CCCTAAGACGGTGA7AC7A 
AG7CGTGGC77G7G7G777G777G77GCCTTGTG777G77CG7CGTTTGTCGACAGCGCCTTGCG 
AA77GG7G7ACCCACACCGCGCGGCT7GCGAA7AA7AC777GGAGAGCC7TT7GCCTCCAG7G7C 
77CCGTC7G7AC7CG7CC7CC7C7CCC7CTCCGGCCGC7C7AGAACTAGTGGA7CCCCCGGGC7G 
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CAGGAATTCGATATCAAGC7TATCGATAAGCTCTAGGTCAATGGTTTGACCAGAACAT ACAAGAG 
CAGTGGGGAATGTGGGAGGGCCTCGAGGTCGGGGTCGCCGTCCTACACATTGTTGTTGT GACGTG 
CGGCCCAGATTCGAA7CTGTAATAAAACCTTTTTTTAACTGAATCCTCAGATTGGCAGT GAGAGG 
AGATTTTGTTCGTGGTGTTGGCTGGCCTACTGGGTGGGCGCAGGGATCGATCCAGACATGATAAG 
5 ATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAATGCTTTATT TGTGAAA 

TTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAAwAACAAT 
TGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCT 
CTACAAATCAAGCTGATCCCTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATG 
CAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGG 

10 CGCGTCAGCGGG7GTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAG 
TGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTG 
AAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACT 
GACTCGCTGCGC7CGGTCCTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACG 
GTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCA 

15 GGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCAC 
AAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCC 
CCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCT 
TTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAG 
GTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATC 

2 0 CGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTG 

GTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 
TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAA 
AAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCA 
AGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCT 
25 GACGCTCAGTGGAACG^AAACTCACGTTAAGGGATTTTCGTCATGAGATTATCAAAAAGGATCTT 
CACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTT 
GGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCA 
TCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCC 
CAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGC 

3 0 CAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAAT 

TGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGC 
TGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGAT 
CAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATC 
GTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCT 

3 5 TACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAG 

AATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACAT 
AGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTT 
ACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTA 
CTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGG 

4 0 GCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAAT 
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I claim: 

1. A recombinant retrovirus vector having a U3-free 5' LTR, a 
partially deleted 3' LTR, all essential cis-acting sequences for replication, an 
5 internal promoter recognizable by a selected host cell, and a non-retroviral gene 
under the control of the recognized promoter wherein: 

(a) the U3-free 5' LTR is positioned 5' of the non-retroviral gene 
and has a transcriptional promoter and enhancer different from that of the original 
retroviral promoter and enhancer replacing the original U3 region of the 5'LTR; 
10 (b) the partially deleted 3' LTR is positioned 3* of the non-retroviral 

gene and has no U3 sequences except for those required at the attachment site for 
viral integration; 

(c) an exogenous polyadenylation addition signal sequence 
recognized by the selected host cell and positioned on the vector 3' to the 3' LTR 
15 viral integration site; and 

t (d) the internal recognized promoter is positioned adjacent to the 
non-retroviral ^ene on the vector to permit expression of the non-retroviral gene in 
the host cell; 

whereby the vector can produce progeny virus in a helper cell with 
2 0 the progeny virus being capable of infecting the selected host cell and forming a 
provirus in the host cell, with the non-retroviral gene being expressible in the host 
cell, but the provirus in the host cell will be replication incompetent even in the 
presence of a helper virus. 

2 5 2, A retrovirus produced by the recombinant retrovirus vector of 

claim 1. 



3. A host cell containing the provirus of claim 1. 
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